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We introduce oo-dimensional versions of three common models of random hetero-polymers, in 
which both the polymer density and the density of the polymer-solvent mixture are finite. These 
solvable models give valuable insight into the problems related to the (quenched) average over the 
randomness in statistical mechanical models of proteins, without having to deal with the hard 
geometrical constraints occurring in finite dimensional models. Our exact solution, which is specific 
to the oo-dimensional case, is compared to the results obtained by a saddle-point analysis and by the 
grand ensemble approach, both of which can also be applied to models of finite dimension. We find, 
somewhat surprisingly, that the saddle-point analysis can lead to qualitatively incorrect results. 
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I. INTRODUCTION 

The study of random hetero-polymers is regarded as a 
natural first step towards a better understanding of the 
physics of proteins . The latter are believed to be spe- 
cial realisations of random hetero-polymers which have 
been selected by nature; before tackling these a-typical 
(selected) cases, one first investigates the properties of 
typical hetero-polymers. Several models of such systems 
have been proposed and studied (for an overview see e.g. 
[§,[§)■ Most involve a coarse-grained description of the 
polymer chain, where each monomer (which may consist 
of many atoms) is represented as a single bead, and ef- 
fective two-body interactions between monomers and/or 
solvent molecules. Since proteins are hetero-polymers of 
fixed composition, one is mostly interested in the case of 
quenched disorder. Yet, studying the case of annealed 
disorder can also be relevant, since it may describe a sit- 
uation in which the monomers are able to change their 
chemical properties (e.g. by undergoing chemical reac- 
tions or by exchanging charge) , and secondly (at a tech- 
nical level) since one can approximate the quenched re- 
sult by a series of annealed averages following the scheme 
of Morita §. 

The principal objective of this paper is to clarify the 
problems that occur when following a saddle point (SP) 
approach in dealing with the (quenched) randomness in 
those coarse grained models of random hetero-polymers 
which take into account the incompressibility of the 
solvent-monomer system. Although the behaviour of 
random hetero-polymers in finite dimensions will obvi- 
ously differ from those in high dimensions (due to the 
crucial role in the former of the chain constraint), we 
find that the study of infinite dimensional models pro- 
vides valuable insight into the appropriateness of some 
of the approximations and assumptions that are usually 
being made, as it disentangles the problems induced by 
the disorder from those generated by the configurational 



entropy. It is beyond the scope of this work to discuss 
the numerous variational approaches that have been in- 
troduced, often in combination with a saddle-point treat- 
ment. Neither do we want to go into details about pos- 
sible replica symmetry breaking (RSB) that may occur 
in some models. Although the dominance of the entropy 
in our present models does not permit real phase transi- 
tions at finite temperatures, we do observe phase coexis- 
tence and, especially in the case of annealed disorder, (re- 
entrant) cross-over between swollen and compact phases, 
which may indicate real (re-entrant) phase transitions in 
the corresponding finite dimensional models. 

This paper is organised as follows. In section II we 
introduce our models, and illustrate connections with re- 
lated models such as those describing matching problems 
and poly-dispersity. In sections III and IV we derive the 
(exact) solution for the case of discrete disorder, and for 
arbitrary disorder distributions (in terms of mesoscopic 
variables), respectively. In section V we study the same 
models with a (naieve) mean field approach, and point 
out the latter's weaknesses. In section VI we follow the 
grand ensemble approach of B, where the quenched av- 
erage is approximated by a series of annealed averages. 
Although hard to implement for real quenched disorder, 
this scheme may quite accurately describe polymers with 
so-called permuted disorder, where the monomers are al- 
lowed to migrate along the chain. Finally, in section VII, 
we discuss our results and present an outlook on future 
work. 



II. MODEL DEFINITIONS 

We consider large chains of random hetero-polymers. 
The position of each monomer i = 1, ... N along a chain 
is given by a vector fi- A function g({r}) will incorporate 
all the constraints on the possible configurations of the 
chain, such as self- avoidance, bond length, lattice con- 



straints etc., such that the free energy of the polymer 
can be written as 



*=- V ^ 



JY 



L[Tr) U(H)exp(-/3H(M)) 



(1) 



In the case of short range 2-body interactions, the inter- 
action Hamiltonian will be of the form 



H({f}) 



N 

E 



AijOfi ,r*j 



(2) 



If the interaction energies Ay G 5ft are drawn indepen- 
dently for each pair (i,j) with i < j, we have a model 
with bond-disorder. Alternatively we could choose for 
each monomer to carry a set of labels Ai S 5ft, specifying 
its electro-chemical properties, and define the interaction 
energies as fixed functions of the labels of the monomers 
involved, i.e. Ay = A(Aj,Aj), which would lead to a 
model with site disorder. 

Since (Q) cannot be calculated for all possible disorder 
realisations, and since the free energy is a self-averaging 
quantity (i.e. the average value is the typical value), 
one is interested in its average over the disorder, the 
so-called quenched free energy T q = (^}{a 4 -} ■ I 11 or_ 
der to investigate the influence of disorder in the inter- 
actions between the different monomers, without having 
to make additional approximations to deal with the con- 
figurational entropy, we consider the following simplified 
oo-dimensional model: 



• There are N monomers labeled i 
sites labeled r = 1, .., R. 



1,..,JV , and R 



• Every site is a neighboring site to every other site, 
and thus the chain constraint of the polymer is al- 
ways trivially satisfied. 

• Each site can contain up to n c monomers. This 
accounts for the incompressibility of the polymer- 
solvent system, and restricts the global density ac- 
cording to N/R < n c , with equality corresponding 
to the situation where all sites are completely filled 
with monomers. 

• Those positions at a given site which are not 
occupied by monomers, are occupied by solvent 
molecules. 

• Only those particles interact which find themselves 
in the same site. 




FIG. 1. Illustration of the oo-dimensional model, for 
n c — 3, with some of the allowed moves indicated with ar- 
rows. The upper configuration is one where the sites are not 
yet ordered; the lower configuration is obtained upon ordering 
the sites according to monomer occupation numbers. 

The problem of calculating the free energy has now been 
reduced to a relatively simple combinatorial one. The 
number of monomers at site r is denoted by n r . Hence, 
in terms of the notation of (II), g({r}) — 1 for all config- 
urations with n r < n c for all r, and otherwise: 



3(W) = II l 1 E S k,n r ({r}) 



(3) 



k>n c 



Note that we must always satisfy the following relations 



s j> 






= N 



N < V^ n r< < n c N 



(4) 



The problem is symmetric under arbitrary permutations 
of the sites. We now order these such that the first C nc N 
sites contain n c monomers, the next C„ c _i-/V contain n c — 1 
monomers, etc. (see FIG. [y). The last CqN sites contain 
no monomers. We define the total fraction of monomers 
that find themselves at a site which is occupied by I 
monomers as /; = ICi, I = 1, ..,n c . Thus 



£^ = £/, = i, £ c < 



a 



(5) 
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A monomer at a site occupied with n c monomers is said 
to be in the (maximally) compact state, a monomer at 
a site where there are no other monomers is said to be 
in a swollen state, and monomers which find themselves 
at sites with an intermediate number of monomers are 
in a semi-compact state. The macroscopic state of the 
polymer is characterised by the numbers {Ci, I — 0, .., n c } 
(which are proportional to the fractions of sites with a 
given monomer occupation number). The overall com- 
pactness of the polymer can be measured by a single pa- 
rameter D, with D = describing a maximally swollen 
configuration and D — 1 describing a maximally compact 
one: 



D = 



(1 



+ C ) 
(1-1/rac) 



(6) 



By varying a, the total monomer density and the fraction 
of monomers in (semi-)compact states at high tempera- 
tures can be made arbitrarily small. The state where all 
monomers are found at the same finite density n c is the 
high dimensional version of the compact globular phase 
for random hetero-polymers, which is often considered in 
the literature. Note that it is not appropriate to view 
the present models as coarse-grained high dimensional 
limit of lattice models, although the short range nature 
of the interactions is mimicked by representing small el- 
ementary volumes by sites and taking only (delta) inter- 
actions within this volume, ignoring the interactions be- 
tween these elementary volumes. For a true high dimen- 
sional limit, the number of sites in an elementary volume 
would increase exponentially with the dimension, while 
here it is kept finite. Hence, it is rather a geometry- free 
version of the finite dimensional continuum models that 
have been considered. 

We can now calculate typical quantities for boxes con- 
taining I monomers, provided we add the entropic contri- 
bution S'ord to the free energy which reflects our ordering 
of sites, given by the logarithm of the associated combi- 
natorial factor: 



log 



(R)\ 



n£o(<w 



Ar[alog(a)-£qiog(a)] , 



;=o 



(7) 



(using Stirling's formula) . As a consequence of the frac- 
tions {C{\ being a finite number of intensive macroscopic 
observables, we will be able to solve our models by mini- 
mizing the free energy with respect to the {C{\ (or, equiv- 
alently the {/;}). Alternatively, we can separate the to- 
tal entropy S into two contributions: the entropy S g of 
grouping the N monomers, 



log 



N\ 



nr=i(<=w (i\ ) c ' n 

Ar(i-^q)iog(iV)-iV(i-jrco-ivjrqiogO!C' i )) 



(8) 
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1=1 
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and, secondly, the residual entropy S p describing the re- 
maining freedom in positioning the monomers at the aN 
sites: 



log 



(aN)\ 



(9) 



(CoN)\ 
(a-Co)Nlog(N)+N[alog(a)-C log(C )-(a-C )} . 

Hence, the total entropy S is given by 

n 

S g + S p = Nlog(N)-N + N[a\og(a)-J2C l log(l\ Q)] 

1=0 

(10) 

Since S g contains a term proportional to Nlog(N), the 
entropy will always dominate the free energy, except at 



properly rescaled (very low) temperatures. This was al- 
ready recognized by Vannimenus et al. pi for so-called 
matching problems. In what follows we will systemati- 
cally omit all contributions to the entropy which are in- 
dependent of the configuration; the entropy may conse- 
quently become negative at low temperatures, and non- 
monotonicity of the free energy need no longer indicate 
a phase transition in the system. 

Our model can be interpreted as a hetero-polymer on a 
simplex in aN— 1 dimensions. It differs, though, from the 
infinite dimensional random Potts model, firstly through 
the fact that the (quenched) randomness is not in the 
background i.e. in the sites or in interactions coupling 
sites (in fact there are no couplings between different 
sites), but in the (mobile) monomers, and secondly since 
only a finite number of monomers are allowed to be at the 
same site. For the latter reason, and because the energy 
per site depends on the (number of) particles present 
at that site, it also differs from the back-gammon model 
of Ritort ftl. However, it shares the property that the 
entropy is not extensive. Our model can be seen as sim- 
ilar to those describing matching problems (with here 
monomers as the objects to be 'matched'). Note that in 
standard matching problems either the density is fixed 
(typically at n c = 2) ||M||, or properly scaled chem- 
ical potentials are introduced |J, both with the objec- 
tive to keep the non-extensive (permutational) part of 
the entropy constant. In contrast, in the present model 
the latter is achieved by limiting the number of avail- 
able sites to aN. Although the interesting behaviour of 
matching problems (such as freezing) only occurs at prop- 
erly rescaled (extremely low) temperatures, and a lower 
bound on the interaction distribution is needed to have a 
finite ground-state energy, we have not investigated this 
temperature regime in the present paper, but in anal- 
ogy with those papers on random hetero-polymer models 
whose assumptions and methods we wish to investigate, 
we have considered continuous disorder distributions of 
a Gaussian form. Finally, one can also view our model 
as one describing the coexistence of a large number (or 
infinite number, in the case of continuous disorder dis- 
tributions) of monomer species, i.e. as a poly-disperse 
mixture (see e.g. flC[| and references therein). 

What remains to fully specify our models is to make ex- 
plicit choices for the pair-interactions between monomers. 
In the present paper we consider the three most com- 
monly used interaction types: 

• In the random hydrophobic/hydrophilic model 
(RHM) jll}-|lj] a label A, is assigned to each 
monomer (with Xi > for hydrophilic and A^ < 
for hydrophobic monomers), and 



Ay = Xi + Xj (Xi i.i.d.r.v.) 



(11) 



1 N 

ft({r}) = ~ 2j[Aj + A.#ri,rv = ^2 Xi n, '> + COnSt 



i^j 



Note that this is only the configuration depen- 



dent term of the full interaction energy. Each con- 
tact of a monomer i with a solvent molecule con- 
tributes — Xi to the energy (hence the name hy- 
drophilicity) . The full Hamiltonian is thus given 
by H = - J2i J2 S ^i$ri,r s =- J2i M{n c - n n ) with 
the index s running over the solvent molecules. 

The random charge model (RCM) [Jl5|-|l7[ (or 'ran- 
dom sequence' model) describes inter-monomer 
Coulomb-like interactions. Again each monomer 
carries a label A^ (its 'charge'). The contribu- 
tion to the energy is proportional to the product 
of the charges of monomers in contact, such that 
equally signed charges either repel (e = 1) or at- 
tract (e = —1) each other: 



Ay = (\i\j (Xi i.i.d.r.v.) 



(12) 



N 



nw) = r Yl WiWi = « E A " + const 



'¥./ 



where A r = ^2 i \i5 r , ri is the total charge in box r, 
and e = ±1. Solvent molecules are assumed to be 
neutral. 

In the random interaction model (RIM) [|18|-p0] (or 
'random bond' model) the \N(N — 1) interactions 
Ajj between the pairs of monomers are themselves 
taken to be independent identically distributed ran- 
dom variables (i.i.d.r.v.), drawn from a probability 
distribution P(A), which may either be discrete or 
continuous: 



Xij i.i.d.r.v. (A ri = Ay) : 
1 N 



(13) 



i^J 



III. DISCRETE DISORDER: EXACT SOLUTIONS 



In this section we deal with the case where the interac- 
tions take discrete values, i.e. where the A, (for RHM and 
RCM) or the Ay (for RIM) have probability distributions 
of the form 



iV„ 



P(A) = ^p CT <5(A-A CT ) 



(14) 



a=l 



with finite N a . We define 'local states' as the monomer 
configurations that can occur at a given site (for site- 
disordered models these local states are fully charac- 
terised by the number of monomers of a given type 
present). Note that for finite n c and finite N a the number 
N\ s of possible local states is also finite. Furthermore, due 
to the nature of the Hamiltonian (0), the total energy is 



just the sum of the local energies in at the aN sites. We 
define the concentration c Sl of a local state si as the num- 
ber of times it occurs among the aN sites (si = 1, .., N\ s ) 
divided by N, and the number of monomers occupying 
a site in local state si by n Sl . We then always have to 
satisfy the normalisation requirements 



£« 



£' 



= i 



(15) 



A. RHM & RCM 

For the (site-disordered) RHM and RCM models, one can 
define the local state at a given site by a set of numbers 
{m a } (a = 1, .., No-), where each m? specifies the number 
of monomers present of type a. In this way we automat- 
ically take care of the invariance under permutations of 
monomers at the same site. The concentration ci m o\ of a 
local state {m a } is the number of times it occurs among 
the sites, divided by N. In addition to (^5|) we now have 
N a — 1 extra independent normalisation conditions, re- 
flecting conservation of monomer species: 



£ m<7 ' c {™°} =P° 

{m"} 



Va' = 2,...N a . (16) 



The total number of possible local states is given by 



^ = £ 



1=0 



N a +l-l\ (N a + n 



I 



(N a + n c 



n c +l J (n c +l)!(Ar CT -l)! ' 
(17) 



It is now straightforward to write down the state- 
dependent part of the free energy, for large N: 



T sd = (E-TS)/N= J2 Hm*} 



llog(r[[ m ^)c {m , } j 



E({m°}) + (18) 
(TV^oo), 



and the models are exactly solved at all temperatures. 
This expression is to be minimized with respect to the 
concentrations Cs m a\ (of which there is only a finite num- 
ber, even for N — > oo, and which play the role of order 
parameters), under the constraints (|15| , |l6| ). This proce- 
dure is in principle straightforward, however, since the 
number of order parameters (i.e. the number of Cr m <r\), 
as given by (|l7J), grows considerably with the number of 
monomer species _/V CT and with n c , we have restricted our- 
selves to the case n c = 3 and n a = 2 (bimodal disorder: 
Xi = ±1). This case turns out to already exhibit all the 
essential features of our models. The results of deter- 
mining the order parameters {c/ m <7\} by minimization of 
dig), are compared with numerical simulations in FIG. @ 
and FIG. |, for the RHM and the RCM respectively. 
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FIG. 2. Order parameters C{ n _ )n+ } as functions of tem- 
perature, for the Random Hydrophobic/Hydrophilic Model 
(RHM), with n c = 3, a = 1, Xi = ±1 and p(Xi = 1) = \. Full 
lines: theory. Dots: numerical simulations (N — 3000). 

We clearly observe the tendencies one expects. In the 
RHM model the low temperature local configurations 
are ones where either a site is fully filled with hydropho- 
bic monomers (which thereby avoid contact with solvent 
molecules) , 




FIG. 3. Order parameters C{ n _ : „ , j as functions of tem- 
perature, for the Random Charge Model (RCM), with n c — 3, 
a — 1, Xi — ±1 and p(Xi — ±1) = ~. Upper graph: e = 1 
(equal charges repel), lower graph: e = — 1 (equal charges 
attract). Full lines: theory. Dots: numerical simulations 
(N = 3000). 

or contains a single hydrophilic monomer (which then 
maximises the contacts with solvent). For the RCM we 
see that at low temperatures for e = 1 (equal charges re- 
pel) the system favors states where opposite charges pair, 
whereas for e = — 1 (equal charges attract) the preferred 
states are the ones where equally charged monomers max- 
imally group together. In all cases the agreement between 
theory and experiment is excellent. 



B. RIM 

In order to solve the Random Interaction Model (RIM) 
we have to follow a slightly different strategy. Firstly, 
because the local state at a site r is now defined by the 
number n r of monomers situated at r, in combination 
with the n r x n r traceless interaction matrix M r of the 
interaction energies between pairs of these monomers. 
The number of independent matrix elements of M r is 
n r (n r — 1)/2, each can take all of the N a different values, 
giving rise to N™ r[ - Ur >' different possible local matri- 
ces. Since we can freely permute the monomers at a 
given site, many of these matrices give rise to the same 
local state, and it is sufficient to select a prototype for 
each of the non-equivalent classes. Given a local inter- 
action matrix M, we can divide the monomers into N t 
classes of size m', t — I, ..,N t , such that M is invari- 
ant under all permutations within the same class, and 
not so for any permutation of two monomers from differ- 
ent classes. Secondly, because only a negligible fraction 
of the labels A^ are actually visible in the interactions, 
we cannot impose the equivalent of the condition in ([H]) , 
but instead we use the fact that the ensemble probability 
that a label A occurs, is given by P(X). Note that this 
equivalent to assuming that the annealed approximation 
of the quenched average is valid; in appendix B we will 
prove the validity of this assumption at finite tempera- 
tures. The state-dependent part of the free energy is now 
asymptotically given by 



T sd = {E- TS)/N = Y, C(„,M) 

(n,M) 



V 



■ log 



11"^! c (nM ) 



E(n, M) + (19) 

(N 



oo 



where m a M is the number of times the interaction a occurs 
one entry above the diagonal in the matrix M, and the 
model is exactly solved at finite temperatures. 




FIG. 4. Order parameters C{ n _ : „ , y and C\ as functions of 
temperature, for the Random Interaction Model (RIM), with 
n c — 3, a = 1, Xij = ±1 and p{Xij = ±1) = |. Full lines: 
theory. Dots: numerical simulations (N = 3000). 

The order parameters c/ n ,M) follow from minimisation of 



(pj|), subject to the constraints (15). For general n c and 
N a it is very cumbersome to determine the number of 
local states N\ s . Therefore, we have again restricted our- 
selves to the simple case n c — 3 (for which the (n, M) are 
fully determined by the numbers {m^}, whence c/ m <r \), 
N a = 2 with N\ s = 8; the result of comparing the predic- 
tions of the theory with numerical simulations is shown 
in FIG. | 



A. RHM & RCM 

For the RHM and the RCM, the solution involves only 
minor adaptations/generalisation of the one obtained for 
discrete disorder. The asymptotic state-dependent con- 
tribution to the free energy can formally be written as a 
functional over the above densities, where A; = {\i k , k — 
1, .., 1} are the labels of the I monomers present at a given 
site: 



-^sd = ^2 / d\i c(l 7 \i 
i=o •* 



E(l,Ai) + 4 log(H <#,£)) 



(N -c oo) (20) 



where dXi = (l~[fc=i dXi k ) and where E(l,Xi) is the con- 
tribution to the system's total energy from a site with 
I monomers characterised by label vector A/. The term 
— c(l, Xi) log(Z!) in the entropy reflects the fact that the 
c(l, Xi) are invariant under permutations of the Ai fc , k — 
1, ..,1. In the case of there being a non- vanishing prob- 
ability for the monomers to be identical, i.e. when the 
probability distribution of the labels contains delta peaks 
(as for discrete disorder) we have to add to the entropy 
the term — c(l, A/) log(JlA n ^0> wuere ^ runs over the dif- 
ferent values which the components of A; take, and where 
n\ = ^2 k=1 5\ i : a ■ The equilibrium values of the concen- 
trations c(l, A;) are to be determined via minimization of 
(pi]), subject to the constraints 



IV. CONTINUOUS DISORDER: EXACT 
SOLUTIONS 

In the general case of arbitrary and possibly continu- 
ous disorder distributions, we introduce densities c(Z, A;) 
of boxes with a specified number I of monomers and 
given labels A; (representing local attributes such as hy- 
drophilicities, charges or sub-matrices of interactions, de- 
pending on the model at hand). Since the For N — ► oo 
we assume the fraction of sites with I monomers and 
A; G [A;,A( + AA;] to be given by c(l,\i)AXi, where 
c(l,Xi) depends continuously on A; and is self-averaging. 
For these assumptions to be true the dimension of A; 
must be small compared to N. The c(l, Xi) are so-called 
mesoscopic variables. Note that our approach is similar 
to that of the sublattice magnetisations as introduced for 
spin models in ]21fl , and applies for as long as the nunrber 
of possible labels A; is sufficiently small to guarantee that 
the number of members in each class of momoners with 
prescribed labels is infinite, and that the fluctuations in 
the c(l, Xi) will not contribute to the free energy. 



l = J^ld, a = J^Q, 



1=1 



1=0 



P(X) = J2 / dXi c(l, A,) £ S(X ik - A) 

1 = 1 J \k=l / 



(21) 



where Q = J dAic(f, A/). Note that the first constraint of 
( |2l| ) need not be imposed separately, since it follows upon 
integrating the third constraint over A. Upon introducing 
corresponding Lagrange multipliers {B, V(X)} to enforce 
the constraints, we obtain our order parameter equations. 
Variation with respect to the c(Z, A;) yields 

c(l,\ l ) = ~exp(-[l + (3(E(l,\ l ) + B + J2V(^))}\- 

(22) 



To simplify notation we define (j)(X) 
S = cxp(-(l+/3/3)): 



B 



exp(-/3P(A)) and 



(U) = 7f n ^( A ^) zM-m,Xi)) 



(23) 



\k=l 



Next we have determine B and </>(A) by enforcing the 
constraints (|2l]), which leads us to the following non- 
linear integral equation for 4>{X) 



/lie j. I 



exp(-/?£(Z,A0) 



v j=i fc=i 



(/-I)! 



Ey n^*^*)) 



W=0 fc=l 



exp(-/?g(;,AQ) 
l\ 



(24) 
1 

= P(A) , 



We now work out our results for the RHM and RCM 
models separately: 

• In the case of the RHM, E{l,Xi) = ELi E i( l > A *J, 
such that ( p4J ) reduces to 

a<l>(\)(jrexp(-PE 1 (l,\)) G ^ ,l \!. 1 I X 



vi = l 



'y GW,l) 1 



(i-iy. 



= PW , 



(25) 



vi=0 



where £(/?,/) ee /dAi^(Ai) exp(-/3£i(Z, Ai)) 
This reduces to a discrete set of equa- 
tions upon inserting the trial solution 4>(X) = 
-P(A)Er=i d i exp(~PEi(l, A))] -1 . The d; are to be 
solved numerically from 






(26) 



• In the case of the RCM, £7(1, Ai) = |(EL=i K? 
and using a Hubbard-Stratonovich transformation, 
we obtain 



a<j){X) ( / Dz exp(\/e/3Az) ]T 



/C(/3,z 



,(*-!) 



/=1 



/^E 

v -7 z=o 



-£(/?,*)' 



/! 



(/-I)! 



P(A) , (27) 



in which Dz ee efecxp(— z 2 /2)/\* / 2tt, and K,(/3,z) = 
f dXi<fr(Xi) exp(V— e(iX\z) . We are not able 
to solve ( pTJ ) for general P(A) and/or variable 
monomer density. 
However, for fixed monomer density, i.e. 

c{l,Xi) = ^,n (nLi^(Aij)exp(-/3^,A;)), in 
combination with the choice 

P(X) = v /^cxp(-t;(A-Ao)/2), we obtain 



/o(™=- 1 )/«c(a+n c £ / S) 1 / n <= 
*W = V ^ X 



(28) 



x exp(--A 2 + Ao(a+n c e/3)A - Ao(a+n c £/3)/2), 
(v— n c e/?) + ^{v-n c e(i) 2 + 4z;(ra c -l)e/3 



The results of solving these equations are compared 
with numerical simulations in FIG. pi for Gaussian 
distributed charges. 



The results of these calculations are compared with 
numerical simulations in FIG. g| for Gaussian dis- 
tributed hydrophilicities/hydrophobicities. 
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FIG. 5. Order parameters C; as functions of temperature, 
for the Random Hydrophobic/Hydrophilic Model (RHM), 
with n c = 3, a = 1, and P(X) - AT(0, 1). Full lines: the- 
ory. Dots: numerical simulations (TV = 3000). 
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FIG. 6. Contour plots of concentrations C2(Ai,A2) for 
the RCM with n c = 2, P(A) = TV(0, 1), a = §, T = 0.2 
and e = ±1. Upper graphs: e = 1 (equal charges repel). 
Lower graphs: e = — 1 (equal charges attract). Left graphs: 
theoretical predictions. Right graphs: numerical simulations 
(N = 3000). 



FIG. 7. Order parameters Ci as functions of tempera- 
ture, for the Random Interaction Model (RIM), with n c = 3, 
a — 1, and P(A) = TV(0, 1). Full lines: theory. Dots: nu- 
merical simulations (N — 4500). Deviations are due to finite 
size effects, which are more profound in the RIM (which has 
bond disorder) than in the RHM and RCM (which have site 
disorder) . 



B. RIM 

In the case of the RIM, we can divide the monomers in 
classes {^,A/} where / is the number of monomers and 
A; = {Xi kt i ml k < m = 1,..,^}. As in the discrete case, 
we cannot impose the equivalent of the P(X) condition 
in (pi]), but at finite temperatures the annealed approx- 
imation is exact (see appendix B), and the ensemble 
probability that the label vector A; occurs, is given by 

Ilfe<m P(^ik,i m )- Hence, the state-dependent part of free 
energy per monomer is 



T* 



1=0 



d\i c{l, Xi) 



^>^n^ 



(29) 



where c(l,Xi) is the density of states. The equilibrium 
distribution c*(l,Xi) is determined by minimization of 
©, yielding 



V. CONTINUOUS DISORDER: NAYVE MEAN 
FIELD SOLUTION 

We now compare the solutions of our models, as ob- 
tained directly in the previous section, to the result of 
following the standard procedure: which is to calculate 
the typical quenched free energy via the replica method: 



-I3F) = lim 



1 



n— >0 



Z n = Tr exp {-(3H) , (31) 



where Z n is the n-replicated partition sum and Tr{ r aj 
denotes the trace over all configurations of the repli- 
cas with the constraint that each box contains at most 
n c monomers. We now perform the average over the 
disorder, and, in analogy with the continuum models 
considered in the literature, we enforce the conditions 
n r — J2i$r,r? and qfy = J2i^r,rfS r , r b by introducing 
the conjugate variables h% and qf^,. This yields the fol- 
lowing expression for the replicated partition sum: 



c(i,Xi)= \[ P(X lk 



N l B 



exp(-0E(l,\i)) (30) 



i<k=l 



zn ) = E / D w> E /^Mcxp u$>?t# 



where TV and B have to be chosen such that the first two 
constraints in (|2l| ) are satisfied. The results of solving 
the remaining equations numerically are compared with 
the results of carrying out numerical simulations in FIG. 



0- 




"* E E «£ ^ + 91 W , qfr- ) + .92 {K , qf r , ) ] (32) 
r,r' a<b 



where gi is given by 



E lo s (A expH E ^ - * E c ; 

l \ ' a a<b 



zab 

a b 



(33) 



and gi is given by 



-/3Ao(Ear< 2 " nNn e ) + &£ £ or , 6r , n a r M arM ,n b r , 



for the RHM 



■ eJ ^Y,ar, b r' <^V< ~ \ log(det(i? or>6r 0) 





for the RCM 


~ 0Xo (V n a2 - 

2 \Z-<ar "r 


_ nN) + £f^ arMi{MarM ,f 




for the RIM 




(34) 



where 



M art br' = n^8 r y6 a fi + Qrr' (1 — ^a,fc) , 



(35) 



(l+e(3X 2 n a r )S ry S a . b + ^-«#(1 - S a , b ) 



Ra 



The MF treatment up to equation (133) is perfectly valid. 
At this stage, however, it is important to note that the 
?i, a € {0, 1, ..,n c } and (J""*, e {0, 1, ..,n°n£/} are discrete 
microscopic variables, rather than self-averaging order 
parameters (compare to e.g. spin-glass theory |22]). Of 
these at finite density, there are O(N) and 0(N 2 ) re- 
spectively, while in the finite dimensional analogon, the 
volume occupied by the polymer is O(N). Note, further- 
more, that for any n c ~ 0(N V ) with v < 1 (in our case 
f = 0), even in the most compact configuration the vast 
majority of the qfy will have to be zero. In appendix B, 
we will show that at finite T the non-zero q^,'s will be 1. 
Having a finite fraction of the non-zero q^,'s > 1, leads 
to a decrease in the entropy of order Nlog(N), while 
the energetic gain is at most extensive (this is specific to 
our oo-dimensional model) . Hence, the integrals over the 
n° and q^, cannot be treated as SP integrals (there is 
no large parameter N). Nevertheless, since the integrals 
cannot be performed explicitly, one could consider SP 
integration as a first approximation. We here find that 
a SP treatment of the h" leads to at least qualitatively 
correct results, which may explain the relative success of 
MF models for homo-polymers |23] . In many studies the 
n° are kept constant anyway, as one focuses on the prop- 
erties of the compact phase, eliminating the SP-problem 
altogether. We will show, however, that a (naive) saddle- 
point treatment of the qfy may lead to qualitatively in- 
correct results. Several variational approaches have been 
proposed to deal with this problem fl^j - |20[| . The qual- 
ity of a variational approach depends on the quality of 
the trial function(s), and there is always the risk that 
the true physical behaviour is not included. To develop 
a self-consistent MF theory for random hetero-polymers, 
it may be necessary to find a good set of "true" order 
parameters. It may well be that the origin of some of 
the incongruences between the results obtained by ana- 
lytical methods for continuum models and those found 
numerically for lattice models pLIlTj , lay in the applica- 
tion of the saddle-point method for strongly fluctuating 



"(/ b 



quantities such as the q„, and q 



A. Mean Field solution of the RHM 

In this model, one can completely avoid the q^, (and 
hence the q^,), by using the equality 



5>x Q X' = E<?<> 



(36) 



Due to the fact that the g"*, appear linearly in gi (34), 
the introduction of the q^, and the solution of the SP 



equations, replaces the g"£, by their average value. As 
they are summed over, one obtains the same (exact) re- 
sult as by using (|3^) directly. Since only one-replica- 
index parameters remain, no replica symmetry break- 
ing (RSB) will occur. After some rewriting, using the 
Hubbard-Stratonovich transformation, replica symmetry 
(RS), and keeping only leading terms in n, we obtain for 
.92 



/3 2 A 2 
nN J Dzlog[ Tr exp(— i h r > — n 2 ,+[3\zn r ') 



N 

It 
r'=l 



(37) 

Similarly, the SP equations with respect to the n", can 
be written 



n r = N Dz 



exp(—ih r — '--^—n 2 + (3\zn r ) 



'Tr^ =1 exp(-m r / - ^n 2 , + /3\zn r >) ' 

(38) 

which have to interpreted as equations for the n". As 
explained before, for any finite n c , the most general so- 
lution for the n r can be written down in terms of a finite 
number of parameters Ci . With the definitions 

i=i i=i 

h{z,l) = -ihi~l(3 2 \ 2 l 2 +l3\zl, 

Af(z) = Y / C l eMHz,l)), 

1=0 



(39) 



and adding the entropic correction ([?]), we obtain the 
following expression for the state-dependent part of the 
free energy per particle in the SP approximation 

BX 2 1 ( n " 

Fsd =-A (n c -A 2 ) ;— A 3 - - i V ]lCihi+ 

Dztog(W(z))-X;C,log(a) , (40) 



z=o 



With the constraint Ai = 1, the equations (Bq) reduce 
to n c + 1 non-equivalent equations for the hi , and n c — 1 
equations for the Ci (I = 2, ..,n c ): 



1= / Dzexp(h(z,l))/M(z) , 



/3A 2 



(J-l) 



(41) 



-log(C 1 ) + ^ T ^log(C ) I = . (42) 

From (pl|), one sees that for all temperatures —i h — 
— oo, and that only differences between the hi are impor- 
tant. Thus, the hi are defined up to a constant, which 



drops out of the final expression for the free energy. At 
any temperature, the free energy (|40j) depends of a fi- 
nite number of parameters C„ c , .., Ci and n„ c , ..,h<z (C\ is 
fixed by normalisation and h\ fixes an irrelevant gauge). 
The SP values of these parameters have to be obtained 
numerically from (|4l| ) and (^2|). The results we obtain, 
are qualitatively correct, but as explained in appendix A 
one can do better by adding the high temperature cor- 
rection 



l\ 



li^cji) 



a 



(43) 



1=2 



to the free energy (BOI) (and make the corresponding cor- 
rection in (ff2)), to obtain even quantitatively excellent 
results. 

To show that we obtain the correct low temperature 
behaviour, we solve equations ([Fy,^) (with the cor- 
rection) in the limit j3 — ► oo analytically. It can be 
shown that in that limit the Ci for intermediate values 
1 < I < m, vanish. Hence, we immediately start with 
this simple ansatz: C„ c A boxes contain n c monomers, 
C\N = (1 — n c C nr )N boxes contain 1 monomer, and 
CqN = (a— l+(n c ~ l)C nc )N boxes contain no monomers. 
We call this phase separation, because the monomers are 
either in the maximally compact or in the swollen phase. 
In the limit /3 — > oo only the leading exponent survives, 
such that 



ih c = -\0 1 X 2 n\ 
i hx = -±/? 2 A 2 

where A is defined via 



- /3\An c 
(3XA 



C 

c 



f°° A 

fn c = n c C nc = Dz = |erfc(-p) 

J A ' v2 



(44) 



(45) 



Inserting all this into (HOT), we obtain for the ground state 
energy 

exp (-A* 2 /2) 
V2^ 



(46) 



E (A*) = -(n c -l) A (l-/*) + A 



= -K-l)/ dXXP(\), 
Jo 

where A* is the value that maximizes 



A* = 



A ' 



K. = l-fc(^) 



(47) 



This result makes perfect sense: given Ao and A, the frac- 
tion of monomers that are negative, i.e. hydrophobic, is 
exactly /* . For energetic reasons, these monomers want 
to be in the maximally compact state (n ri — n c ), while 
the others are hydrophilic and want to be maximally 
swollen (n Ti =1). Note that the free energy can become 
arbitrarily negative for n c — > oo. At high temperature 
we know the exact solution, and at low temperature the 



energetic term dominates and is calculated exactly by 
SP integration, because (3 acts as the large parameter. 
Numerical analysis shows that also at intermediate tem- 
peratures the obtained curves are indistinguishable from 
the exact solution (FIG. 0). 

We conclude that the RHM is probably the only model 
in which the SP method leads to qualitatively correct re- 
sults. This could be anticipated by the observation that 
using (p|), the qfy could have been avoided altogether. 



B. Mean Field Solution of the RCM 

Before turning to the technical analysis of the model, 
we anticipate what to expect for the low temperature 
behaviour. 

• First, we consider the case e = +1, such that equal 
charges repel each other. When the average charge 
is (Ao = 0), we expect the monomers to group 
together into boxes such that the total charge in 
each box goes to zero, because the energy is ex- 
actly the sum of the squares of the local charges A r 
(||). When A ^ (take A > without loss of 
generality), however, there is a total charge AAo, a 
total negative charge NX n , and we define A by 



dXiP(Xi)(Xi - A) = -X v 



(48) 



The ground state energy per monomer Eq is then 
limited from below by 



r-A />oo 

E > dX l P{X l )X 2 l + A 2 dX l P{X l 

JO J A 



)• (49) 



In this case the most positive charges are compen- 
sated by negative ones to become A, until all neg- 
ative charges are used up (with a finite number of 
charges in each box, it is not guaranteed that this 
can indeed be done). The remaining (excess) posi- 
tive charges will stay alone in a box. A closer look 
at the expression for g\ (g4|) shows that it is well de- 
fined at all temperatures for e = +1 (equal charges 
repel). 

• For the case e = — 1, we expect the monomers to 
group together in order to maximize |A r | for all the 
occupied boxes, and the ground state energy per 
monomer is hence limited by 



En>- 



dXiP(Xi)(n c Xif 



(50) 



A closer look at the expression for g\ ( pif ) reveals 
that there is a critical (3 C (dependent on n c , A 2 , but 
independent of n) for any finite n, such that for 
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(3 > j3 c the matrix R ar ,br' is no longer positive def- 
inite. This is of course an artifact of the replica 
method, which first calculates the annealed aver- 
age of n copies of the system and then takes the 
limit n — > 0. For any finite n > 0, there is a 
phase transition at f3 c after which the variance of 
the Xi diverges. The central problem is the fact 
that this phase transition makes the result for fi- 
nite n useless to obtain information about the sys- 
tem in the limit n — > 0. Therefore, the replica 
method is not applicable for low temperature re- 
sults with e = — 1. This problem should not oc- 
cur for distributions P(A) with the property that 
lim A ^ ±oc P(A) exp(aA 2 ) -* , Va € JR. 

We now follow the same procedure as in the previous 
section. The SP equations for the qfy yield 



i q. 



R 2 \ 2 \ 2 

ab _ P A A *rq jsjb 



N3 = 



(1 + £/3A 2 <) 



(51) 



such that (at least in the SP approximation), the two- 
replica parameters reduce to a product of one-replica or- 
der parameters, and no RSB will occur. As explained 
before, for any finite n c the most general solution for the 
n r can be written down in terms of a finite number of 
parameters C/. With the definitions (B9|) and 



-i m 



h(z,l) 

n. 



R 2 \ 2 \ 2 \ 2 

^^Nf+pX^zNt, (52) 



/ 



1=1 



(1 + e/3X 2 l) ' 



(53) 



after some rewriting, using result (pl|), RS, the entropic 
correction ([43] ) and the Hubbard-Stratonovich transfor- 
mation, we obtain for the state-dependent part of free 
energy 



•7"sd = 



eXl 



1 [ "■ r 

Y^lCiNi-- iJ2lCini+ / Dz\og(M(z)y 



L 1=1 



£aiog(S + ±i>io g ' v ' 

1=0 1=1 



(54) 



The hi have to be determined from the fixed point equa- 
tions 



1= Dz 



cxp(/i(z, I)) 
MM 



(55) 



and we obtain the result that the qjk — 0(1/N). Again, 
the qjk are replaced by their average values, but in con- 
trast to the RHM, this has severe consequences. In the 
low temperature limit we find that Ni ~ l/e/3A 2 , and 
— ihi = log(Z) VZ, such that we obtain 



£o=0 



(56) 



for the ground state energy. Because of (ff9[), this result 
can obviously not be correct for Aq ^ 0. 



C. Mean Field Solution of the RIM 

Following a similar procedure for the RIM, it is not possi- 
ble to decouple the "saddle-point" equations with respect 
to the q^r' f° r the different replica's, because the qf^, are 
not the product of single replica quantities as was the 
case for the RHM and RCM. Therefore, extra approxima- 
tions, such as variational approaches, are needed. Never- 
theless, one can see that in a SP treatment, the <A will 
again be replaced by their average values. From d34) one 
easily sees that the g"£, dependent term is O(N) when 
q^r> € {0, l,..,n c }, but becomes 0(1) when we replace 
them by an averaged value. 

In appendix B, we show that at any finite tempera- 
ture the dominant contribution comes from the entries 
qf^,i = 1 • This alternatively explains why the q depen- 
dent part of ( p4| ) for the RIM will be a constant and the 
quenched average reduces to the annealed one at finite 
T. Furthermore, we introduce a set of true order param- 
eters which can be treated using the SP method. We 
show that a finite fraction of q^, > 1 implies a reduction 
of the entropy of order (TV log (TV)), while the energetic 
gain is at most extensive. Only at reduced temperatures 
T" = T/y/log(N), the system starts deviating from the 
annealed behaviour, because the energetic part of the free 
energy can compete with the over-extensive part of the 
entropy. We only signal the onset of this phenomenon. 
A detailed study of the low temperature behaviour is be- 
yond the scope of this present paper, and has been done 
for the very similar matching problems (|,||||. Hence, 
due to the invariance of the model for permutations of 
the monomers (which is definitely not true for finite di- 
mensional models!), it turns out that the simple annealed 
average is equivalent to the quenched one (at finite tem- 
peratures). We will return to this issue in the next section 
on annealed averages. 

Nevertheless, even at finite temperatures deviations from 
the annealed behaviour are observed in the simulations 
(see FIG. 6). These are finite size effects, because lim- 
itations in computer memory do not allow system sizes 
large enough to make 1/ y/log(N) effects negligible. 



VI. CONTINUOUS DISORDER: GRAND 
ENSEMBLE APPROACH 

An alternative approximation scheme is the Grand En- 
semble Approach. It consists in approximating the 
quenched average by a series of constrained annealed av- 
erages, and is originally due to Morita 0J. For a re- 
view and detailed description, we refer to |24| . Since the 
chain constraint is negligible in our simple model, we ex- 
pect good results upon fixing only the moments of the 
overall probability distribution of the randomness. In 
finite dimensional models, however, this does not pre- 
vent the random variables to choose their most favorable 
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position along the chain. Therefore, one may have to en- 
force conditions on the correlations along the chain, in 
order to obtain a good approximation for the quenched 
free energy [|14| . Apart from being an approximation for 
the quenched case, the (constrained) annealed average 
also describes a different physical situation where the 
monomers are allowed to change their properties on the 
same time scale as their configuration. 
Using the same notation as in the previous section, we 
find that the equations for the hi are temperature inde- 
pendent, and the same as in the quenched case at infinite 
temperature. Therefore, we can correct them exactly to 
yield the entropic term 



^a 1 =-A (n c -A 2 )-i/3 2 A 2 (A 3 



£c,bg(J!Ci) 



(57) 



1=0 



A. Annealed Averages for the RHM 

First, we calculate the average of the partition sum di- 
rectly 



exp(-/3iV^ aQ ) = (Z) , 
which yields the annealed free energy 

T aa =-AoK-A 2 )-i/3A 2 (A 3 -2n c A 2 +n 2 



P 



(58) 



(59) 



At intermediate temperatures, the SP equations for the 
fi,l = 2,.. 1 n c have to be solved numerically. In the low 
temperature limit however, they can be solved analyti- 
cally with the phase separation ansatz (/j = 1 — f nc ), 
and yield for the ground state energy 



K° = (W C *)K-1) [A + i/?A 2 K-l)] 



(60) 



The minimum occurs clearly for /* = 0, such that all the 
monomers will typically be hydrophilic and alone in their 
boxes. 

In FIG. |S| a, we observe re-entrant cross-over from a 
swollen via a compact back to a swollen phase for suf- 
ficiently negative Ao , indicating real re-entrant phase 
transitions in models with an extensive entropy E3]. 

Secondly, we calculate the average of the partition sum 
imposing the constraint that the average of the Xi is equal 
to that of the quenched one 



cxp(-/3NF ai ) ee (Z (H^Ai-AoTV)). 



(61) 



We introduce the conjugate variable M to enforce the 
constraint on the average. This is a genuine self- 
averaging order parameter with with SP value M — 
/3(A 2 — n c ). The free energy is then given by 



X 
P 



(62) 



At intermediate temperatures, the SP equations for the 
fi,l = 2, ..,?i c have to be solved numerically. In the low 
temperature limit, however, they can be solved analyti- 
cally with the phase separation ansatz (/i = 1 — f„ c ) to 
yield for the ground state energy 



K 1 =-AoK-l) + i/3A 2 (n c -l) 2 (/„ c -/ 2 ) 



(63) 



The minimum is at /* = i, such that half of the 
monomers will be hydrophobic and in a completely filled 

If 

a) 
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FIG. 8. a,b,c The order parameters Ci as functions 
of temperature, for the Random Hydrophilic/Hydrophobic 
model (RHM), with n c = 3, a = 1, and P(A) = Af(-5, 1): a) 
Simple annealed, b) annealed with fixed mean, c) annealed 
with fixed mean and variance. 
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box, while the other half will be hydrophilic and alone in 
their box. In FIG. || b, we observe cross-over to phase 
separation at low temperatures, indicating a real phase 
transition in models with an extensive entropy fl4J| . 
Thirdly, we impose the constraints that both the average 
and the variance of the Xi must be equal to the quenched 
ones. 



exp {-(3NT a 



Z<5(^(A l -A ) ( 5(^((A i -Ao) 2 -A 2 ) 



B. Annealed Averages for the RCM 

In order to describe both the cases e — ±1 at all tem- 
peratures, we have limited ourselves to the case where 
both the average and the variance of the disorder are 
constrained. Introducing the conjugate variables M and 
5 respectively, we obtain the following expression for the 
free energy 



(64) 

To impose these constraints, we introduce the conjugates 
variables M and 5. Their SP values are M = ^0- + 
(3(A 2 -n c ), and 5 = ±(-1 + v /l+4A 2 /3 2 (A 3 -A 2 )). The 
free energy is then given by 



T a 



5-log(l+5)- 



\ 2 M 2 



^^%% 



(1+5) l=0 

I 2 



E^ 1 ^ 1 - 



f+fiO' S 'd+® 



/3 



e/3X 2 l 
(1+5) 

(68) 



T a2 =-\o(n c -A 2 )- 



log(l + 5) 
2/3 



j5 



(i\ 2 



2(1 + 5) 



(A 3 -A 2 )+^ 



(65) 



At intermediate temperatures, the SP equations for the 
fi,l = 2, ..,n c have to be solved numerically. In the low 
temperature limit, however, they can be solved analyti- 
cally with the phase separation ansatz (/i = 1 — f„ c ) to 
yield for the ground state energy 



E a ' 2 



A (n c -A 2 )-AA/A 



A 2 



The 



minimum occur 



s for 



/c=|(l- 



v/A^+Ai 



:), 



(66) 



(67) 



which is already very close to the quenched result (47). 
In FIG. |[ c we observe that the cross over type of be- 
haviour in the case of very negative Ao has vanished as it 
should for the quenched case. In FIG. £)| we compare ap- 
proximation (pq) with quenched numerical simulations. 
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FIG. 9. The order parameters Ci as a function of tem- 
perature, for the Random Hydrophilic/hydrophobic Model 
(RHM) with n c = 3, a = 1, and P(A) = 7V(0, 1). Full lines: 
annealed theory with fixed mean and variance. Dots: nu- 
merical (quenched) simulations (iV = 3000). The results are 
qualitatively correct. 




This approximation is compared with the results of 
Qjienched numerical simulations in FIG. 10.a,b. One ob- 
j@rves that this leads to at least qualitatively correct re- 
sults. 




FIG. 10. a,b The order parameters Ci as function of 
temperature, for the Random Charge Model (RCM) with 
n c = 3, a - 1, e = ±1 resp., and P(X) = 7V(0, 1). Full 
lines: annealed theory with fixed mean and variance. Dots: 
(quenched) numerical simulations (iV = 3000). The results 
are qualitatively correct. 
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C. Annealed Averages for the RIM 



Due to the fact that there are N(N—l)/2 disorder param- 
eters Xij , fixing any overall moment of their probability 
distribution does not have any effect. One can e.g. check 
that the leading order of the SP values of M and S is 
N(N—l)/2, while the sub-leading order is 0(1), and the 
same holds for higher order moments. This is another 
indication that the simple annealed average is exact (see 
FIG. @). 



T — T - 

J q — J ao 



2(^0 



f3X 2 



)A 2 



X 

' p 



(69) 



The divergence of the energy at low temperature is due 
to the number of disorder parameters, such that rare, 
energetically very favorable combinations of monomers 
can be formed, to yield arbitrarily low energies. This 
would of course not be true for a distribution P(X) with 
a support that is bounded from below. 



VII. CONCLUSIONS 

We have studied a simple model for random hetero- 
polymcrs in oo-dimensions, the finite dimensional ana- 
logues of which had been studied using various ap- 
proaches Q|- |2(J. The simplicity and transparency of 
the model has allowed us to formulate an exact solu- 
tion in terms of true order parameters for discrete disor- 
der, and in terms of self-averaging mesoscopic variables 
for continuous disorder. Although the behaviour of fi- 
nite dimensional models will obviously be very different, 
due to the crucial role in the latter of the chain con- 
straint, our model gives valuable insight into the prob- 
lems inherent in the saddle point approximation for the 
parameters that occur naturally in a mean field (replica) 
treatment. Whereas qualitatively correct for the random 
hydrophilic-hydrophobic model, the results for the ran- 
dom charge model are found to be even qualitatively in- 
correct. For the random interaction model additional as- 
sumptions are needed, but in our oo-dimensional model 
at finite temperatures, the simple annealed average is 
found to coincide with the quenched result. Variational 
approaches have been successful in capturing certain as- 
pects of the behaviour of random hetero-polymers, but it 
may well be that the origin of some of the incongruences 
between the results obtained by analytical methods for 
continuum models and those found numerically for lattice 
models Q,|7| , lay in the undue application of the saddle 
point approximation for strongly fluctuating quantities. 
It is not our goal to diminish previous work which has 
led to many valuable insights, but to draw the reader's 
attention to the inherent risks of the saddle point approx- 
imation in random hetero-polymers. 

The Grand Ensemble Approach |i 24 is a different 
powerful tool to describe the case of permuted disorder, 
but it is technically very hard to impose independence of 



the disorder along the sequence. Hence, a good detailed 
description of the quenched case based on the Grand En- 
semble Approach is as yet still out of reach in finite di- 
mensional models [H . 

We have established links between our model, and the 
Backgammon model ||, models for polydispersity p0[ , 
and matching problems P,[7hP[. The latter in particular 
suggests that in order to develop a self-consistent mean 
field theory for random hetero-polymers, it may be nec- 
essary to find a good set of "true" order parameters (see 
appendix B). 

The protein folding problem can be viewed as a 
geometrically constrained matching problem for the 
monomers. Hence, new insights may be gained using the 
techniques developed for the matching problem HQ-g], 
imposing some weak (geometrical) constraints that can 
be treated analytically. The low temperature phase for 
the random interaction model certainly needs further in- 
spection. It can be viewed as a generalisation of the 
existing results for matching problems, and may be qual- 
itatively different due to the non-bounded nature of the 
ground state energy. Although absent at finite densities, 
phase transitions may occur when the density is allowed 
to become arbitrarily high. 

Finally, the study of the dynamics of this model both 
at finite and at high density is currently in progress and 
will be published elsewhere. 

I am grateful to A.C.C. Coolen and R. Kiihn for valu- 
able suggestions. 
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APPENDIX A: CORRECTIONS TO THE 
SADDLE-POINT 

The first correction to the SP for the free energy is easily 
calculated, and given by 



1 1 ™ c 

-logdet(tf*)~— £C,log(0 



(Al) 



1=2 



where H* is the Hessian of —j3^ q with respect to the n r 
taken in the SP (Ew . As we will see now, this correction 
is not enough to obtain the correct entropy at high tem- 
perature. 

In the limit j3 — ► equations ( |ll|) simplify to —ihi = 
log(Z), and the entropy is given by 
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which is not correct! From a simple box counting ar- 
gument one can show that the non-constant part of the 
entropy in the high temperature limit is given by 



£ Q log(Cz) - E Ci log(H) = - £ d log(Z! d) 



around the SP contribute to the same order in N . In- 
stead, at all temperatures, we add the high temperature 
correction 



1 " c _ 
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(A4) 



to the free energy (Efl) (and the corresponding correction 
to (^2|)). In this way, we are at least ensured to obtain 
the exact result in the high temperature limit. 



APPENDIX B: VALIDITY OF THE ANNEALED 
APPROXIMATION FOR THE RIM. 



For simplicity and for comparison with the so-called 
matching problem f),p]U, we take n c — 2, a = -^- = |- 
We introduce the fol owing order parameters 
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which have the clear physical meaning of being the frac- 
tion of monomers that are coupled to the same partner 
in all replicas ai,..,Ofe. The quenched free energy per 
particle (see (B2|)) can then be rewritten as 
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where A^ = n c (for a — —). S m is the entropy of 
replica m given the configuration of replicas 1, .., m— 1 and 
given the mutual overlaps d ai ,..,a k , 1 < a± < .. < ak <m 
To evaluate this entropy we assume replica symmetry 
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,n, Voi < .. < a k . 



The particles of the m-th replica can be divided into the 
following non-equivalent groups: 
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The first term comes from the permutation of the boxes, 
while the second one comes from the permutation s o f 
monomers within the same box. To obtain result ( |A3| ) 
analytically, one should in principle perform all the in- 
tegrations over the n r exactly, or equivalently, consider 
the whole expansion around the SP. This is an unfeasible 
task, since at all temperatures all orders in the expansion 



where d\ = 1 and ^2 



TTl — 1 \ 

k=\ \ k-^ I Sm ' k = ■*" N° te that k 
indicates the number of replicas (including replica m) in 
which the couples are equal. Hence, the couples in the 
groups with fc > 1 are fixed, because they are equal to 
at least one couple in replicas 1, ..m— 1. The remaining 
monomers in s m ,i are still free to form couples with the 
only restriction that they are not equal to any of the 
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existing couples in replicas 1, ..m— 1. Hence, the entropy 
of replica m is composed of both the possible divisions 
in groups, and the possible couplings in the last group of 
size Ns m ^\. The total number of ways in which R 3> 1 
particles can be coupled is denoted by Ir, while I R is the 
number of ways in which R particles can be coupled such 
that none of the couples is equal to any of the couples the 
other replicas. The number of already existing couples 
in the other replicas is c-f with 1 <c< (to — 1). Thus, 
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S R ~R/2(log(R)-l), 
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Ir <Ir< Ir-—Ir-2, S r ~ S R -OQ.og(R)). 



Using (B3)-(B4), the entropy <S m of replica m is given by 



are no longer valid, since the s m k are no longer of order 

Although our reasoning remains qualitatively valid for 
n c > 2 and a > — , the details become extremely com- 
plicated as the necessary set of order parameters will be 
given by 
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1 <ai < .. <ak <n , fc = 2,..,?i, £ = 2,..,n c . 

The latter being the fraction of monomers that are in the 
same t-tuple in replicas oi, .., Oft . 



o m _ 2 



E ( jfc-l ) s m,felog(s m , fe )+ (B5) 



k=1 



+ s m)1 (log(JV)-l) 



under the assumption that s m .i = 0(1). The total con- 
tribution of n replicas to the entropy S = 5Z m _i <5m to 
leading order in N is now given by 



S^ log(A0 5> m4 = — log(iV) 



where we have used the equalities 
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Rescaling the temperature /3 = f3' y/log(N) , yields the 
following state-dependent part of the quenched free en- 
ergy per particle to leading order 
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This expression has to be maximized with respect to the 
dk with fc even, while it has to be minimized with respect 
to the dk with fc odd. The dk cannot be freely extremised 
as there are constrained from the fact that all group sizes 
s m ,k are positive: e.g. 0<g?2< 1, max(0, 2di— I)<d3<d2, 
etc. 

For (3' < t the entropy dominates and the annealed ap- 
proximation (i.e. dk — 0, Vfc > 1) is exact. For /3' > j- 
the energy starts competing and solutions with d fc > 
can be found. Eventually, the system will com p le tely 
freeze, i.e. dk — 1, Vfc, but then expressions (B5)-(BJ) 
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